Probability Mass Exclusions and the Directed Components of Pointwise Mutual Information
نویسندگان
چکیده
The pointwise mutual information quantifies the mutual information between events x and y from random variable X and Y . This article considers the pointwise mutual information in a directed sense, examining precisely how an event y provides information about x via probability mass exclusions. Two distinct types of exclusions are identified—namely informative and misinformative exclusions. Then, inspired by Fano’s postulates for the pointwise mutual information, three postulates are proposed that aim to decompose the pointwise mutual information into a non-negative informational component associated with the informative exclusions, and a non-positive informational component associated with the misinformative exclusions. This leads to a novel derivation of a familiar decomposition of the pointwise mutual information into its entropic components. The paper concludes by discussing the relevance of considering information in terms of probability mass exclusions to the ongoing effort to decompose multivariate information.
منابع مشابه
MuSweeper: An Extensive Game for Collecting Mutual Exclusions
Mutual exclusions provide useful information for learning classes of concepts. We designed MuSweeper as a MineSweeper-like game to collect mutual exclusions from web users. Using the mechanism of an extensive game with Imperfect information, our experiments showed MuSweeper to collect mutual exclusions with high precision and efficiency.
متن کاملConndence Intervals on Likelihood Estimates for Estimating Association Strengths
Sparse data causes errors in the maximum-likelihood estimates of event probabilities that are often large enough to render measures of association such as pointwise mutual information useless for small sample sizes. This squib describes a procedure for estimating event probabilities that produces a conndence interval estimate rather than a point estimate. Using these conndence intervals in calc...
متن کاملA Probabilistic Model of Redundancy in Information Extraction
Unsupervised Information Extraction (UIE) is the task of extracting knowledge from text without using hand-tagged training examples. A fundamental problem for both UIE and supervised IE is assessing the probability that extracted information is correct. In massive corpora such as the Web, the same extraction is found repeatedly in different documents. How does this redundancy impact the probabi...
متن کاملThe Use of Web-based Statistics to Validate Information Extraction
The World Wide Web is a powerful and readily available text corpus that can be used effectively to validate the output of an information extraction system. We present experiments that explore how pointwise mutual information (PMI) from search engine hit counts can be used in an Assessor module that assigns a probability that an extracted fact or relationship is correct, thus boosting precision....
متن کاملPointwise Information Decomposition Using the Specificity and Ambiguity Lattices
What are the distinct ways in which a set of predictor variables can provide information about a target variable? When does a variable provide unique information, when do variables share redundant information, and when do variables combine to provide complementary information? The redundancy lattice from the partial information decomposition of Williams and Beer provided a promising glimpse at ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1801.09223 شماره
صفحات -
تاریخ انتشار 2018